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DETAILED ACTION 

Specification 

1 . The disclosure is objected to because of the following informalities: On 
paragraph [0030] for X 2 it says "Index in the frequency axis (1 ... 128 of the first 
estimated formant." This sentence is missing the closing parenthesis after the number 
128 and also it should say second estimated formant instead of first. Appropriate 
correction is required. 

Claim Rejections - 35 USC §112 

2. The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

3. Claims 4 and 8 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

Claim 4 recites the limitation "said threshold value" in line 1 . There is insufficient 
antecedent basis for this limitation in the claim. However, it appears to be dependent 
from claim 2 and has been treated as such for the purposes of examination. Affirmation 
of this is required by the appropriate amendment. 

Claim 8 recites the limitation "said inverse filtering" in line 1. There is insufficient 
antecedent basis for this limitation in the claim. However, it appears to be dependent 
from either claim 6 or 7. This claim will be treated as being dependent from claim 6 for 
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the purposes of examination. Affirmation of this is required by the appropriate 
amendment. 

Claim Rejections - 35 USC § 102 

4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

5. Claim 1 is rejected under 35 U.S.C. 102(b) as being anticipated by Maura et al. 
(US 2001/0001853). 

As per claim 1 , Maura teaches a method of detecting speech in an incoming 
signal comprising the steps of: 

performing a preprocessing step of extracting a noise estimate of the incoming 
signal to augment signal-to-noise ratio of a speech signal (see paragraph [0034] where 
channel noise energy estimates are provided by the noise energy estimator 214a of Fig. 
2); and 

measuring the periodicity of the incoming signal using an autocorrelation function 
to determine whether a signal frame correspond to a speech frame or not (see 
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paragraph [0039] where the rate decision element 212 of Fig. 2 for determining the 
presence of speech, may be substituted by a normalized autocorrelation function 
(NACF) which measures periodicity in the speech frame). 



Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

7. Claims 2 and 3 are rejected under 35 U.S.C. 103(a) as being obvious over Maura 
et al. (US 2001/0001853) in view of Gong et al. (US Patent 6,980,950). 

The applied reference has a common inventor with the instant application. 
Based upon the earlier effective U.S. filing date of the reference, it constitutes prior art 
only under 35 U.S.C. 102(e). This rejection under 35 U.S.C. 103(a) might be overcome 
by: (1 ) a showing under 37 CFR 1 .132 that any invention disclosed but not claimed in 
the reference was derived from the inventor of this application and is thus not an 
invention "by another"; (2) a showing of a date of invention for the claimed subject 
matter of the application which corresponds to subject matter disclosed but not claimed 
in the reference, prior to the effective U.S. filing date of the reference under 37 CFR 
1.131; or (3) an oath or declaration under 37 CFR 1.130 stating that the application and 
reference are currently owned by the same party and that the inventor named in the 
application is the prior inventor under 35 U.S.C. 104, together with a terminal disclaimer 
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in accordance with 37 CFR 1 .321 (c). This rejection might also be overcome by showing 
that the reference is disqualified under 35 U.S.C. 103(c) as prior art in a rejection under 
35 U.S.C. 103(a). See MPEP § 706.02(l)(1) and § 706.02(l)(2). 

As per claim 2, Mauro et al. teaches the method of claim 1 , but doesn't disclose 
that said periodicity measurement is defined as: 

T h 

p = max Rjc(t) 
T, 

where r„ and T, are pre-specified so that the period will range in the range of 
speech and the signal is speech if pis above a given threshold. Gong et al. teaches the 
periodicity measurement (equation (1 1) on Col. 4). It would have been obvious to one 
having ordinary skill in the art to use the periodicity measurement as taught by Gong et 
al. for Mauro et al.'s method for detecting speech because Gong et al. provides an 
utterance detector for speech recognition. 

As per claim 3, Mauro et al. as modified by Gong et al. teach the method of claim 
2, wherein said period is between about 75 Hz and 400 Hz (Gong et al.'s Col. 4, lines 9- 
10). 

8. Claim 4 is rejected under 35 U.S.C. 103(a) as being unpatentable over Mauro et 
al. (US 2001/0001853) as modified by Gong et al. (US Patent 6,980,950) as applied to 
claim 2 above, and further in view of Wu et al. (US Patent 6,473,735). 

The applied reference has a common inventor with the instant application. 
Based upon the earlier effective U.S. filing date of the reference, it constitutes prior art 
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only under 35 U.S.C. 102(e). This rejection under 35 U.S.C. 103(a) might be overcome 
by: (1) a showing under 37 CFR 1 .132 that any invention disclosed but not claimed in 
the reference was derived from the inventor of this application and is thus not an 
invention "by another"; (2) a showing of a date of invention for the claimed subject 
matter of the application which corresponds to subject matter disclosed but not claimed 
in the reference, prior to the effective U.S. filing date of the reference under 37 CFR 
1.131; or (3) an oath or declaration under 37 CFR 1.130 stating that the application and 
reference are currently owned by the same party and that the inventor named in the 
application is the prior inventor under 35 U.S.C. 104, together with a terminal disclaimer 
in accordance with 37 CFR 1.321(c). This rejection might also be overcome by showing 
that the reference is disqualified under 35 U.S.C. 103(c) as prior art in a rejection under 
35 U.S.C. 103(a). See MPEP § 706.02(l)(1) and § 706.02(l)(2). 

As per claim 4, Maura et al. as modified by Gong et al. teach the method of claim 
2, but they don't disclose said threshold value to be set to maximize speech detection 
accuracy. Wu et al. teach that when speech verifier utilizes the minimum differential 
scores as the threshold values, recognition accuracy is maximized (Col. 8, lines 63-66). 
It would have been obvious to one having ordinary skill in the art to use the feature of 
setting the threshold value to maximize speech detection as taught by Wu et al. for 
Maura et al.'s method for detecting speech as modified above because Wu et al. 
provides a method for speech verification using a confidence measure. 



Application/Control Number: 10/649,270 Page 7 

Art Unit: 2609 

9. Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Mauro et 

al. (US 2001/0001853) as applied to claim 1 above, and further in view of Focht (US 

Patent 3,603,738) and Graumann (US Patent 6,175,634). 

As per claim 5, Mauro et al. teach the method of claim 1 , but doesn't specifically 

disclose said extracting step including the steps of: 

converting the spectrum of the incoming signal into logarithmic domain, 
removing high frequency components in logarithmic domain by recurrent filtering 

along the time axis, 

establishing an estimate of noise background, converting the estimate into linear 
domain, and 

suppressing the noise background from the signal, in linear domain. 
Focht teaches a circuitry where a speech input is supplied to a logarithmic amplifier and 
a low pass filter for removing the high frequency components (logarithmic amplifier 52 
and low pass filter 54 of Fig. 7, also Col. 7, lines 56-57 and lines 69-73). It would have 
been obvious to one having ordinary skill in the art to use the features of a logarithmic 
amplifier and low pass filter as taught by Focht for Mauro et al.'s method for detecting 
speech because Focht provides means and methods for generating a plurality of 
parameters representative of a speech wave. 

Graumann teaches a noise suppressor that includes a frame energy estimator, an 
attack and decay throttle, an energy mapper, a log-to-linear converter, and an 
attenuator that attenuates the input signal according to a signal received from the log-to- 
linear converter (noise suppressor 260, log-to-linear converter 275, and attenuator 276 
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from Fig. 17, also Col. 11, lines 1-23 describe the process for the suppression of the 
noise background or attenuation of the signal). It would have been obvious to one 
having ordinary skill in the art to use the feature of a noise suppressor as taught by 
Graumann for Mauro et al.'s method for detecting speech as modified by Focht above 
because Graumann provides a noise suppression solution which improves the overall 
quality of transmitted audio. 

10. Claims 6-9 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Mauro et al. (US 2001/0001853) as modified by Focht (US Patent 3,603,738) and 
Graumann (US Patent 6,175,634) as applied to claim 5 above, and further in view of 
Markel (Digital Inverse Filtering - A New Tool for Formant Trajectory Estimation, June 
1972). 

As per claim 6, Mauro et al. as modified above teach the method according to 
claim 5, but fail to specifically disclose including the preprocessing step of spectral 
inverse filtering. Markel teaches a method for spectral inverse filtering (left column of 
page 132, at the end of the 4 th paragraph, "inverse filter is designed to transform the 
input spectrum into a white noise or constant spectrum../', also Fig. 4 (D) shows the 
analysis result of the inverse filter). It would have been obvious to one having ordinary 
skill in the art to use the feature of an inverse filter as taught by Markel for Mauro et al.'s 
method for detecting speech as modified above because Markel provides an inverse 
filter useful for estimating resonance or formant structure of voiced speech. 
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As per claim 7, Mauro et al. as modified above teach the method according to 
claim 6, wherein said inverse filtering is based on a normalized approximation of the 
envelope of the short term speech spectrum derived from a local maxima of the short 
term speech spectrum (In Markers part IV (The Inverse Filter Algorithm) on page 133, 
3 rd paragraph on the right, "the magnitude spectrum corresponding to the estimate of 
the input spectrum is computed. The magnitude spectrum is scanned and the L k local 
maxima p k (l), 1=1, 2, . . L k in frame k are recorded. The set of all local maxima defines 
the raw data from which the formant trajectories are to be estimated". Also on part IV„ 
page 131, 3 rd paragraph on the right, "since speech is a continually time-varying 
process, short-term analysis of sets of contiguous data samples at some specified 
frame rate is needed".) 

As per claim 8, Mauro et al. as modified above teach the method according to 
claim 6, wherein said inverse filtering is performed ina log frequency domain and is 
implemented by subtracting from the original spectrum the estimated inverse filtering 
spectrum (In Markers part III (Formant Extraction and the Inverse Filter) on page 130, 
where Fig. 1 (A) shows the representative spectrum, Fig. 1 (C) shows the estimated 
inverse filtering spectrum and Fig. 1 (D) shows the spectrum of the inverse filter output. 
From the figures it can be appreciated that the filtering is performed in a log frequency 
domain and that the inverse filter output of Fig. 1 (D) is obtained from subtracting the 
spectrum of Fig. 1 (C) from the original spectrum of Fig. 1 (A)). 

As per claim 9, Mauro et al. as modified above teach the method according to 
claim 6, including spectral reshaping to create an artificial valley in the spectrum before 
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inverse filtering if the original spectrum has two formants close together and low in the 
speech spectrum (In Markers part IV (The Inverse Filter Algorithm), on page 132, 3 rd 
paragraph on the left, for the spectrum of Fig. 4 (A) "it is quite clear that there are two 
closely spaced low formants and two additional formants", a Hamming window is 
applied to the original spectrum of Fig. 4 (A) in order to obtain the spectral 
representation of Fig. 4 (C)). 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Natalie Lennox whose telephone number is (571) 270- 
1649. The examiner can normally be reached on Monday to Friday 7:30 am - 5:00 pm 
(EST). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Xiao Wu can be reached on (571) 272-7761. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-91 99 (IN USA OR CANADA) or 571 -272-1 000. 

NL 2/22/2007 
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